tsflex: Flexible time series processing & feature extraction
نویسندگان
چکیده
Time series processing and feature extraction are crucial time-intensive steps in conventional machine learning pipelines. Existing packages limited their applicability, as they cannot cope with irregularly-sampled or asynchronous data make strong assumptions about the format. Moreover, these do not focus on execution speed memory efficiency, resulting considerable overhead. We present tsflex , a Python toolkit for time that focuses performance flexibility, enabling broad applicability. This leverages window-stride arguments of same type sequence-index, maintains sequence-index through all operations. is flexible it supports (1) multivariate series, (2) multiple configurations, (3) integrates functions from other packages, while (4) making no sampling regularity, alignment, type. Other functionalities include multiprocessing, detailed logging, chunking sequences, serialization. Benchmarks show faster more memory-efficient compared to similar being permissive its utilization.
منابع مشابه
AMP: a new time-frequency feature extraction method for intermittent time-series data
The characterisation of time-series data via their most salient features is extremely important in a range of machine learning task, not least of all with regards to classification and clustering. While there exist many feature extraction techniques suitable for non-intermittent time-series data, these approaches are not always appropriate for intermittent timeseries data, where intermittency i...
متن کاملFeature eXtraction from sparse time series data
We present a computational methodology for qualitative analysis of sparse and noisy time series. Information about the changes of the signal level within a time series and the number of distinguishable signal levels is extracted and condensed into a pattern string. The qualitative analysis of a time series can be done at several levels of detail to generate pattern strings that encode the seque...
متن کاملMulti-dimensional sparse time series: feature extraction
We show an analysis of multi-dimensional time series via entropy and statistical linguistic techniques. We define three markers encoding the behavior of the series, after it has been translated into a multi-dimensional symbolic sequence. The leading component and the trend of the series with respect to a mobile window analysis result from the entropy analysis and label the dynamical evolution o...
متن کاملDynamical Feature Extraction from Brain Activity Time Series
Neurologists typically study the brain activity through acquired biomarker signals such as Electroencephalograms (EEGs) which have been widely used to capture the interactions between neurons or groups of neurons. Detecting and identifying the abnormal patterns through visual inspection of EEG signals are extremely challenging and require constant attention for well trained and experienced spec...
متن کاملA Time Series Forest for Classification and Feature Extraction
A tree-ensemble method, referred to as time series forest (TSF), is proposed for time series classification. TSF employs a combination of entropy gain and a distance measure, referred to as the Entrance (entropy and distance) gain, for evaluating the splits. Experimental studies show that the Entrance gain improves the accuracy of TSF. TSF randomly samples features at each tree node and has com...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: SoftwareX
سال: 2022
ISSN: ['2352-7110']
DOI: https://doi.org/10.1016/j.softx.2021.100971